Shellsort

Shellsort
Shellsort with gaps 23, 10, 4, 1 in action.
Class	Sorting algorithm
Data structure	Array
Worst case performance	depends on gap sequence
Best case performance	O(N)
Average case performance	depends on gap sequence
Worst case space complexity	O(1)

Shellsort, also known as Shell sort or Shell's method is an in-place comparison sort. It generalizes an exchanging sort, such as insertion or bubble sort, by allowing the comparison and exchange of elements that lie far apart. Its first version was published by Donald Shell in 1959.^[1]^[2] The running time of Shellsort is heavily dependent on the gap sequence it uses. For many practical variants, determining their time complexity remains an open problem.

1 Description
2 Pseudocode
3 Gap sequences
4 Computational complexity
5 Applications
6 See also
7 References
8 Bibliography
9 External links

Description

Shellsort is a multi-pass algorithm. Each pass is an insertion sort of the sequences consisting of every h-th element for a fixed gap h (also known as the increment). This is referred to as h-sorting.

An example run of Shellsort with gaps 5, 3 and 1 is shown below.

$\begin{array}{rcccccccccccc} &a_1&a_2&a_3&a_4&a_5&a_6&a_7&a_8&a_9&a_{10}&a_{11}&a_{12}\\ \hbox{input data:} & 62& 83& 18& 53& 07& 17& 95& 86& 47& 69& 25& 28\\ \hbox{after 5-sorting:} & 17& 28& 18& 47& 07& 25& 83& 86& 53& 69& 62& 95\\ \hbox{after 3-sorting:} & 17& 07& 18& 47& 28& 25& 69& 62& 53& 83& 86& 95\\ \hbox{after 1-sorting:} & 07& 17& 18& 25& 28& 47& 53& 62& 69& 83& 86& 95\\ \end{array}$

The first pass, 5-sorting, performs insertion sort on separate subarrays (a₁, a₆, a₁₁), (a₂, a₇, a₁₂), (a₃, a₈), (a₄, a₉), (a₅, a₁₀). For instance, it changes the subarray (a₁, a₆, a₁₁) from (62, 17, 25) to (17, 25, 62). The next pass, 3-sorting, performs insertion sort on the subarrays (a₁, a₄, a₇, a₁₀), (a₂, a₅, a₈, a₁₁), (a₃, a₆, a₉, a₁₂). The last pass, 1-sorting, is an ordinary insertion sort of the entire array (a₁,..., a₁₂).

As the example illustrates, the subarrays that Shellsort operates on are initially short; later they are longer but almost ordered. In both cases insertion sort works efficiently.

Shellsort is unstable: it may change the relative order of elements with equal values. It has "natural" behavior, in that it executes faster when the input is partially sorted.

Pseudocode

Using Marcin Ciura's gap sequence, with an inner insertion sort.

# Sort an array a[0...n-1].
gaps = [701, 301, 132, 57, 23, 10, 4, 1]
 
foreach (gap in gaps)
    # Do an insertion sort for each gap size.
    for (i = gap; i < n; i += 1)
        temp = a[i]
        for (j = i; j >= gap and a[j - gap] > temp; j -= gap)
            a[j] = a[j - gap]
        a[j] = temp

Gap sequences

Every gap sequence that contains 1 yields a correct sort; however, the properties of thus obtained versions of Shellsort may be very different.

The table below compares most proposed gap sequences published so far. Some of them have decreasing elements that depend on the size of the sorted array (N). Others are increasing infinite sequences, whose elements less than N should be used in reverse order.

General term (k ≥ 1)	Concrete gaps	Worst-case time complexity	Author and year of publication
$\lfloor N / 2^k \rfloor$	$\left\lfloor\frac{N}{2}\right\rfloor, \left\lfloor\frac{N}{4}\right\rfloor, \ldots, 1$	$\Theta(N^2)$ [when N=2^p]	Shell, 1959^[1]
$2 \lfloor N / 2^{k%2B1} \rfloor %2B 1$	$2 \left\lfloor\frac{N}{4}\right\rfloor %2B 1, \ldots, 3, 1$	$\Theta(N^{3/2})$	Frank & Lazarus, 1960^[3]
$2^k - 1$	$1, 3, 7, 15, 31, 63, \ldots$	$\Theta(N^{3/2})$	Hibbard, 1963^[4]
$2^k %2B 1$ , with 1 prepended	$1, 3, 5, 9, 17, 33, 65, \ldots$	$\Theta(N^{3/2})$	Papernov & Stasevich, 1965^[5]
successive numbers of the form $2^p 3^q$	$1, 2, 3, 4, 6, 8, 9, 12, \ldots$	$\Theta(N \log^2 N)$	Pratt, 1971^[6]
$(3^k - 1) / 2$ , not greater than $\lceil N / 3 \rceil$	$1, 4, 13, 40, 121, \ldots$	$\Theta(N^{3/2})$	Knuth, 1973^[7]
$\prod \limits_{\scriptscriptstyle 0\le q<r\atop \scriptscriptstyle q\neq(r^2%2Br)/2-k}a_q, \hbox{where}$ $r = \left\lfloor \sqrt{2k%2B\sqrt{2k}} \right\rfloor,$ $a_q=\min\{n\in\mathbb{N}\colon n\ge(5/2)^{q%2B1},$ $\forall p\colon0\le p<q\Rightarrow\gcd(a_p,n)=1\}$	$1, 3, 7, 21, 48, 112, \ldots$	$O(N e^\sqrt{8\ln(5/2)\ln N})$	Incerpi & Sedgewick, 1985^[8]
$4^k %2B 3\cdot2^{k-1} %2B 1$ , with 1 prepended	$1, 8, 23, 77, 281, \ldots$	$O(N^{4/3})$	Sedgewick, 1986^[9]
$9(4^{k-1}-2^{k-1})%2B1, 4^{k%2B1}-6\cdot2^k%2B1$	$1, 5, 19, 41, 109, \ldots$	$O(N^{4/3})$	Sedgewick, 1986^[9]
$h_k = \max\left\{\left\lfloor 5h_{k-1}/11 \right\rfloor, 1\right\}, h_0 = N$	$\left\lfloor \frac{5N}{11} \right\rfloor, \left\lfloor \frac{5}{11}\left\lfloor \frac{5N}{11} \right\rfloor\right\rfloor, \ldots, 1$	?	Gonnet & Baeza-Yates, 1991^[10]
$\left\lceil \frac{9^k-4^k}{5\cdot4^{k-1}} \right\rceil$	$1, 4, 9, 20, 46, 103, \ldots$	?	Tokuda, 1992^[11]
unknown	$1, 4, 10, 23, 57, 132, 301, 701$	?	Ciura, 2001^[12]

When the binary representation of N contains many consecutive zeroes, Shellsort using Shell's original gap sequence makes Θ(N²) comparisons in the worst case. For instance, this case occurs for N equal to a power of two when elements greater and smaller than the median occupy odd and even positions respectively, since they are compared only in the last pass.

Although it has higher complexity than the O(NlogN) that is optimal for comparison sorts, Pratt's version lends itself to sorting networks and has the same asymptotic gate complexity as Batcher's bitonic sorter.

Gonnet and Baeza-Yates observed that Shellsort makes the fewest comparisons on average when the ratios of successive gaps are roughly equal to 2.2.^[10] This is why their sequence with ratio 2.2 and Tokuda's sequence with ratio 2.25 prove efficient. However, it is not known why this is so. Sedgewick recommends to use gaps that have low greatest common divisors or are pairwise coprime.^[13]

With respect to the average number of comparisons, the best known gap sequences are 1, 4, 10, 23, 57, 132, 301, 701 and similar, with gaps found experimentally. Optimal gaps beyond 701 remain unknown, but good results can be obtained by extending the above sequence according to the recursive formula $h_k = \lfloor 2.25 h_{k-1} \rfloor$ .

Tokuda's sequence, defined by the simple formula $h_k = \lceil h'_k \rceil$ , where $h'_k = 2.25 h'_{k-1} %2B 1$ , $h'_1 = 1$ , can be recommended for practical applications.

Computational complexity

The following property holds: after h₂-sorting of any h₁-sorted array, the array remains h₁-sorted.^[14] Every h₁-sorted and h₂-sorted array is also (a₁h₁+a₂h₂)-sorted, for any nonnegative integers a₁ and a₂. The worst-case complexity of Shellsort is therefore connected with the Frobenius problem: for given integers h₁,..., h_n with gcd = 1, the Frobenius number g(h₁,..., h_n) is the greatest integer that cannot be represented as a₁h₁+ ... +a_nh_n with nonnegative integer a₁,..., a_n. Using known formulae for Frobenius numbers, we can determine the worst-case complexity of Shellsort for several classes of gap sequences. Proven results are shown in the above table.

With respect to the average number of operations, none of proven results concerns a practical gap sequence. For gaps that are powers of two, Espelid computed this average as $0.5349N\sqrt{N}-0.4387N-0.097\sqrt{N}%2BO(1)$ .^[15] Knuth determined the average complexity of sorting an N-element array with two gaps (h, 1) to be $2N^2/h %2B \sqrt{\pi N^3 h}$ .^[7] It follows that a two-pass Shellsort with h = Θ(N^1/3) makes on average O(N^5/3) comparisons. Yao found the average complexity of a three-pass Shellsort.^[16] His result was refined by Janson and Knuth:^[17] the average number of comparisons made during a Shellsort with three gaps (ch, cg, 1), where h and g are coprime, is $\frac{N^2}{4ch}%2BO(N)$ in the first pass, $\frac{1}{8g}\sqrt{\frac{\pi}{ch}}(h-1)N^{3/2}%2BO(hN)$ in the second pass and $\psi(h, g)N %2B \frac{1}{8}\sqrt{\frac{\pi}{c}}(c-1)N^{3/2}%2BO((c-1)gh^{1/2}N) %2B O(c^2g^3h^2)$ in the third pass. ψ(h, g) in the last formula is a complicated function asymptotically equal to $\sqrt{\frac{\pi h}{128}}g %2B O(g^{-1/2}h^{1/2}) %2B O(gh^{-1/2})$ . In particular, when h = Θ(N^7/15) and g = Θ(h^1/5), the average time of sorting is O(N^23/15).

Based on experiments, it is conjectured that Shellsort with Hibbard's and Knuth's gap sequences run in O(N^5/4) average time,^[7] and that Gonnet and Baeza-Yates's sequence requires on average 0.41NlnN(ln lnN+1/6) element moves.^[10] Approximations of the average number of operations formerly put forward for other sequences fail when sorted arrays contain millions of elements.

The graph below shows the average number of element comparisons in various variants of Shellsort, divided by the theoretical lower bound, i.e. log₂N!, where the sequence 1, 4, 10, 23, 57, 132, 301, 701 has been extended according to the formula $h_k = \lfloor2.25 h_{k-1}\rfloor$ .

Applying the theory of Kolmogorov complexity, Jiang, Li, and Vitányi proved the following lower bounds for the order of the average number of operations in an m-pass Shellsort: Ω(mN^1+1/m) when m≤log₂N and Ω(mN) when m>log₂N.^[18] Therefore Shellsort has prospects of running in an average time that asymptotically grows like NlogN only when using gap sequences whose number of gaps grows in proportion to the logarithm of the array size. It is, however, unknown whether Shellsort can reach this asymptotic order of average-case complexity, which is optimal for comparison sorts.

The worst-case complexity of any version of Shellsort is of higher order: Plaxton, Poonen, and Suel showed that it grows at least as rapidly as Ω(N(logN/log logN)²).^[19]

Applications

Shellsort is now rarely used in serious applications. It performs more operations and has higher cache miss ratio than quicksort. However, since it needs relatively short code and does not use the call stack, some implementations of the qsort function in the C standard library targeted at embedded systems use it instead of quicksort. Shellsort is, for example, used in the uClibc library.^[20] For similar reasons, an implementation of Shellsort is present in the Linux kernel.^[21]

Shellsort can also serve as a sub-algorithm of introspective sort, to sort short subarrays and to prevent a pathological slowdown when the recursion depth exceeds a given limit. This principle is employed, for instance, in the bzip2 compressor.^[22]

References

^ ^a ^b Shell, D.L. (1959). "A High-Speed Sorting Procedure". Communications of the ACM 2 (7): 30–32. doi:10.1145/368370.368387. http://penguin.ewu.edu/cscd300/Topic/AdvSorting/p30-shell.pdf.
^ Some older textbooks and references call this the "Shell-Metzner" sort after Marlene Metzner Norton, but according to Metzner, "I had nothing to do with the sort, and my name should never have been attached to it." See "Shell sort". National Institute of Standards and Technology. http://www.nist.gov/dads/HTML/shellsort.html. Retrieved 2007-07-17.
^ Frank, R.M.; Lazarus, R.B. (1960). "A High-Speed Sorting Procedure". Communications of the ACM 3 (1): 20–22. doi:10.1145/366947.366957.
^ Hibbard, Thomas N. (1963). "An Empirical Study of Minimal Storage Sorting". Communications of the ACM 6 (5): 206–213. doi:10.1145/366552.366557.
^ Papernov, A.A.; Stasevich, G.V. (1965). "A Method of Information Sorting in Computer Memories". Problems of Information Transmission 1 (3): 63–75. http://www.mathnet.ru/links/83f0a81df1ec06f76d3683c6cab7d143/ppi751.pdf.
^ Pratt, Vaughan Ronald (1979). Shellsort and Sorting Networks (Outstanding Dissertations in the Computer Sciences). Garland. ISBN 0-824-04406-1.
^ ^a ^b ^c Knuth, Donald E. (1997). "Shell's method". The Art of Computer Programming. Volume 3: Sorting and Searching (2nd ed.). Reading, Massachusetts: Addison-Wesley. pp. 83–95. ISBN 0-201-89685-0.
^ Incerpi, Janet; Sedgewick, Robert (1985). "Improved Upper Bounds on Shellsort". Journal of Computer and System Sciences 31 (2): 210–224.
^ ^a ^b Sedgewick, Robert (1986). "A New Upper Bound for Shellsort". Journal of Algorithms 7 (2): 159–173. doi:10.1016/0196-6774(86)90001-5.
^ ^a ^b ^c Gonnet, Gaston H.; Baeza-Yates, Ricardo (1991). "Shellsort". Handbook of Algorithms and Data Structures: In Pascal and C (2nd ed.). Reading, Massachusetts: Addison-Wesley. pp. 161–163. ISBN 0-201-41607-7.
^ Tokuda, Naoyuki (1992). "An Improved Shellsort". In van Leeuven, Jan. Proceedings of the IFIP 12th World Computer Congress on Algorithms, Software, Architecture. Amsterdam: North-Holland Publishing Co.. pp. 449–457. ISBN 0-444-89747-X.
^ Ciura, Marcin (2001). "Best Increments for the Average Case of Shellsort". In Freiwalds, Rusins. Proceedings of the 13th International Symposium on Fundamentals of Computation Theory. London: Springer-Verlag. pp. 106–117. ISBN 3-540-42487-3. http://sun.aei.polsl.pl/~mciura/publikacje/shellsort.pdf.
^ Sedgewick, Robert (1998). "Shellsort". Algorithms in C++, Parts 1-4: Fundamentals, Data Structure, Sorting, Searching. Reading, Massachusetts: Addison-Wesley. pp. 285–292. ISBN 0-201-35088-2.
^ Gale, David; Karp, Richard M. (1972). "A Phenomenon in the Theory of Sorting". Journal of Computer and System Sciences 6 (2): 103–115. doi:10.1016/S0022-0000(72)80016-3.
^ Espelid, Terje O. (1973). "Analysis of a Shellsort Algorithm". BIT Numerical Mathematics 13 (4): 394–400. doi:10.1007/BF01933401.
^ Yao, Andrew Chi-Chih (1980). "An Analysis of (h, k, 1)-Shellsort". Journal of Algorithms 1 (1): 14–50. doi:10.1016/0196-6774(80)90003-6.
^ Janson, Svante; Knuth, Donald E. (1997). "Shellsort with Three Increments". Random Structures and Algorithms 10 (1-2): 125–142. doi:10.1.1.54.9911. http://arxiv.org/abs/cs/9608105.
^ Jiang, Tao; Li, Ming; Vitányi, Paul (2000). "A Lower Bound on the Average-Case Complexity of Shellsort". Journal of the ACM 47 (5): 905–911. doi:10.1.1.6.6508. http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.6.6508.
^ Plaxton, C. Greg; Poonen, Bjarne; Suel, Torsten (1992). "Improved Lower Bounds for Shellsort". Annual Symposium on Foundations of Computer Science 33: 226–235. doi:10.1.1.43.1393. http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.43.1393.
^ Manuel Novoa III. "libc/stdlib/stdlib.c". http://codesearch.google.com/codesearch/p?hl=en#bLhHCxq0Bgw/downloads/uClibc-snapshot.tar.bz2%7C7hKHs61H6WA/uClibc/libc/stdlib/stdlib.c&l=787. Retrieved 2011-03-30.
^ "kernel/groups.c". http://codesearch.google.com/codesearch/p?hl=en#d7MjWbXYfN0/kernel/groups.c&sa=N&cd=8&ct=ri&l=105. Retrieved 2011-03-30.
^ Julian Seward. "bzip2/blocksort.c". http://codesearch.google.com/codesearch/p?hl=en#hfE6470xZHk/third_party/bzip2/blocksort.c&l=472. Retrieved 2011-03-30.

Bibliography

Knuth, Donald E. (1997). "Shell's method". The Art of Computer Programming. Volume 3: Sorting and Searching (2nd ed.). Reading, Massachusetts: Addison-Wesley. pp. 83–95. ISBN 0-201-89685-0.
Analysis of Shellsort and Related Algorithms, Robert Sedgewick, Fourth European Symposium on Algorithms, Barcelona, September 1996.

External links

Shellsort with gaps 5, 3, 1 as a Hungarian folk dance

Sorting algorithms

Theory	Computational complexity theory Big O notation Total order Lists Stability Comparison sort Adaptive sort Sorting network Integer sorting

Exchange sorts	Bubble sort Cocktail sort Odd–even sort Comb sort Gnome sort Quicksort Stooge sort Bogosort

Selection sorts	Selection sort Heapsort Smoothsort Cartesian tree sort Tournament sort Cycle sort

Insertion sorts	Insertion sort Shellsort Tree sort Library sort Patience sorting

Merge sorts	Merge sort Polyphase merge sort Strand sort

Distribution sorts	American flag sort Bead sort Bucket sort Burstsort Counting sort Pigeonhole sort Proxmap sort Radix sort Flashsort

Concurrent sorts	Bitonic sorter Batcher odd–even mergesort Pairwise sorting network

Hybrid sorts	Timsort Introsort Spreadsort UnShuffle sort JSort

Quantum sorts	Spaghetti sort

Other	Bogosort Topological sorting Pancake sorting